On Laboratory Testing of Text Retrieval Systems
نویسنده
چکیده
The 45-year history of information retrieval evaluation includes a range of experiments which might be categorised in the light of the tradition division between in vitro and in vivo experiments in biology. In this talk, I will explore some of the characteristics of laboratory or in vitro experiments, and contrast them with operational system or in vivo experiments. The present state of the field shows a clear predominance of the laboratory approach, characterised by TREC, NTCIR and various other similar endeavours. However, there are research questions that can only be answered in an operational environment; the two approaches are complementary. It is in any case not a simple division: it is more like a spectrum, one end represented by complete experimental control and the other by complete realism. Actually the extremes are neither possible nor interesting all real experiments involve some degree of compromise between the two. In particular, the trend in TREC towards a wider variety of tasks reflects in part various attempts to introduce at least some of the conditions of real-world systems. There is always conflict between the requirements of laboratory control and those of realism; compromise is both difficult and necessary.
منابع مشابه
Reducing Retrieval Time in Automated Storage and Retrieval System with a Gravitational Conveyor Based on Multi-Agent Systems
The main objective of this study is to reduce the retrieval time of a list of products by choosing the best combination of storage and retrieval rules at any time. This is why we start by implementing some storage rules in an Automated Storage/Retrieval System (Automated Storage and Retrieval System: AS/RS) fitted with a gravity conveyor while some of these rules are dedicated to storage and ot...
متن کاملImage retrieval using the combination of text-based and content-based algorithms
Image retrieval is an important research field which has received great attention in the last decades. In this paper, we present an approach for the image retrieval based on the combination of text-based and content-based features. For text-based features, keywords and for content-based features, color and texture features have been used. Query in this system contains some keywords and an input...
متن کاملUsing Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کاملUsing Text Surrounding Method to Enhance Retrieval of Online Images by Google Search Engine
Purpose: the current research aimed to compare the effectiveness of various tags and codes for retrieving images from the Google. Design/methodology: selected images with different characteristics in a registered domain were carefully studied. The exception was that special conceptual features have been apportioned for each group of images separately. In this regard, each group image surr...
متن کاملUsing Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کامل